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Abstract 

Let X = {X{t),t G [0, T]} be a second order random process of which n independent realizations are observed 
on a fixed grid of p time points. Under mild regularity assumptions on the sample paths of X, we show the 
asymptotic normality of suitable nonparametric estimators of the trend function ^ = EX in the space C([0,r]) 
as n,p ^ cxD and, using Gaussian process theory, we derive approximate simultaneous confidence bands for p. 

Resume 

Inference non parametrique d'une tendance avec des donnees fonctionnelles. Soit X={X{t),t G [0, T]} 
un processus aleatoire du second ordre dont on observe n realisations independantes sur une grille de p points 
deterministes. Sous de faibles conditions de regularite sur les trajectoires de X, nous prouvons la normalite 
asymptotique d'estimateurs non parametriques de la tendance /i = EX dans I'espace C([0, T]) lorsque n,p ^ oo, 
puis nous obtenons des bandes de confiance simultanees approchees pour /i a I'aide de la theorie des processus 
Gaussiens. 



1. Introduction 

In various application fields such as internet traffic monitoring, medical imagery, or signal processing, 
modern technology has allowed to collect data routinely from population samples with a high temporal 
and/or spatial resolution. Indeed, such datasets should be viewed as (collections of) curves or functions 
rather than as high-dimensional vectors; they are thus commonly termed functional data. (See [71|T3] for 
a comprehensive introduction to functional data analysis.) Typical functional data may be modeled as 
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observations of independent realizations Xi, . . . , Xn of a second order random process X = {X{t) , t G V} 
at fixed design points ti, . . . , ip, where V denotes a continuous temporal and/or spatial domain. In this 
framework, the observed data are 

r„ =X,(i,)+e„, l<t<n,l<j<p, (1) 

where the are mean zero random variables (r.v.) representing potential measurement errors. 

The trend function fi — EX often appears as a population mean response function, which motivates its 
inference. The nonparametric regression literature contains several results on the asymptotic properties 
of estimators of /i as the sample sizes n and p go to infinity. For instance when T? = [0, 1], mean-square 
convergence rates of kernel and spline estimators can be found in [2, ^ ^ i9] . When P is a compact metric 
space, [5] gives a universal consistency result as well as the asymptotic normality of all usual regression 
estimators in the sense of finite dimensional distributions and of the space L2 (2?) , with an application 
to simultaneous confidence intervals. The task of building (nonparametric and simultaneous) confidence 
bands for /z, which proves useful in various problems of prediction, model diagnostic, or calibration (e.g. 
[IldT]), has received considerable attention in the classical regression setting (e.g. [HITS]) but not, to 
our knowledge, for functional data. 

In this Note, we study the model ^ in the case where the random process X is indexed by a compact 
interval T) = [0, T] and has mildly regular sample paths. In Section 2, we state the asymptotic normality 
of suitable nonparametric estimators of fi in the space C([0, T]) of all continuous functions on [0,T] as 
n,p —* 00. In Section 3, we use the former results to build approximate simultaneous confidence bands 
for fi. Finally in Section 4, some potential applications and extensions of our results are discussed. 



2. Asymptotic normality of nonparametric estimators 

We state here the assumptions made on the random process X and on the model ([T]) of Section 1. 
(A.l) X is mean-square continuous onT) — [0, T]. 

(A. 2) The sample paths X{u!,-) are almost surely (a.s.) variation-bounded, with their total variation 

bounded by B{u}), where B is a r.v. with finite variance; or 
(A. 2') \X{llj,s) ~ X{LLj,t)\ < C\s — a.s. for some positive constants C and (3. 
(A.S) fj. has two bounded derivatives on [0, T]. 

(B.l) The data form a triangular array : Yij =Yij{n), tj =tj{n), and p = p(n), withp(n) ^ 00 as n ^ 00. 
(B.2) The random errors Sij are mutually independent and independent of the X^; they have mean zero 

and common variance cr^ > 0. 
(B.3) The tj are ordered (0 < ti < . . . < tp < T) and they have a quasi-uniform repartition, i.e., writing 

to = and U+i = T, it holds that '^<^^o<j<p{tj+i-t,) ^ 

Note that (A. 2') implies (A. 2). Also, (B.3) ensures that the tj are regularly spaced in [0,T]; it is fulfilled 
e.g. when the tj are equally spaced or are generated by a regular probability density function (p.d.f.). 

For each assumption (A. 2) and (A. 2'), we now introduce a suitable nonparametric estimator of /i and 
give its asymptotic distribution. Under (A. 2), we use the interpolation-type estimator of [4], denoted by 
fic, with a boundary correction. We recall here its definition. Let Yj = {J27=i ^y )/'^ 1 < j ^ Pi and 
let Y{t) be the process obtained by linear interpolation of the {tj, Yj) such that Y{t) — Yi ii t < ti and 
Y(t) =Yp if t > tp. The estimator 'fic is the convolution of a kernel function K with Y : 

licit) = -T ^- / Kh{t - u)Y{u)du, Kh{-) = K{-/h)/h. (2) 

jp Kh{t - u)du Jo 
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For convenience we take _ftr as a symmetric, compactly supported, Lipschitz-continuous p.d.f.. The real 
ft. > is a fixed bandwidth. Following [T^], we say that a sequence (Z„) of random elements of C{[0,T]) 
converges weakly to a limit Z in C([0, T]) if E(p(Z„) — > E(/3(Z) as n — > oo for all uniformly continuous 
functional ip on C([0,T]) equipped with the sup-norm. We denote by R the covariance function of the 
process X and by Q{0, C) any Gaussian process indexed by [0, T] with mean zero and covariance C. We 
are now in position to state the weak convergence of Jlc in C([0, T]) as n,p —^ oo (recall that p = p{n)). 

Theorem 2.1 Assume that (A.1),(A.2),(B.1)-(B.3) hold and that h — h{n,p) and pK^ oo as 
n,p oo. Then n}/'^{'jlc — E/ic') converges weakly to Q{0,R) in C([0, T]). If in addition (A. 3) holds 
and n = o{p), nh^ ^ as n^p — > oo, then n^^^('jj,c — '^^^o converges to Q(0,R) in C([0,T]). 

Next, we address the case where X satifies (A. 2') (Holder continuity). We consider the local linear 
estimator, denoted here by /Jl and defined by 

Mt) = Oo, (9„, di) = argmin(9^, H - - {tj - Khit, - t). (3) 

Theorem 2.2 Assume that (A.1),(A.2'),(B.1)-(B.3) hold and that h = h{n,p) and ph"^ —> oo as 
n,p — > oo. Then n^/'^['jlL — E/Ii) converges weakly to Q{0,R) in C{[0,T]). If in addition (A. 3) holds, 
n = o{p'^), and nh^ — > as n,p oo, then n^^'^ijlL — /i) also converges to Q{0,R) in C([0,T]). 
Remarks. 

(1) The proofs of the former theorems are similar and rely on the following steps: (i) note that the 
estimator is linear in the data; (ii) use the functional central limit theorem 10.6 of [12] for the 
estimator applied to the data without noise Xi{tj); (iii) show that under condition ph"^ — > oo, the 
estimator applied to the errors e.y becomes negligible in probability before n~^^'^ as n,p — > oo, 
uniformly over [0, T]; (iv) impose additional conditions on fi and on the joint rates of n,p and h to 
make the asymptotic bias of the estimator as 0(71^^/^) uniformly over [0,T] (in particular the rate 
n = o{p) used in Theorem 2.1 is only needed to control the boundary effects in the bias offic)- 

(2) Under (A. 2), we can prove asymptotic normality in C(0,T]) only for /Ic- This is because fie, as 
opposed to more classical estimators, has the remarkable feature of preserving monotonicity and 
thus satisfies (A. 2) like X, which makes the step (ii) of the theorem proof straightforward. On 
the other hand, under the stronger assumption (A. 2') the asymptotic normality in C(0,T]) can be 
obtained for various classical kernel or projection estimators, as well as for fie- The choice of fiL 
here was motivated by the popularity of this estimator and by its good bias properties. 

(3) The condition ph'^ 00 can be dropped in both theorems if there is no noise in the data {a — 0). 
Besides, the results carry over to the case of correlated errors, e.g. of autoregressive or mixing 
type. 

(4) Theorem 2.2 corrects a mistake in the Section 4 of [H] which gives (np/i)^/^ as the normalizing 
rate for the weak convergence of fiL{t) — E/2i(t). (The condition (C3*) {ph ^ as n ^ 00) of this 
paper does not produce a well-defined estimator for large n). On the other hand, our normalizing 
rate n^^"^ is consistent with the variance rate found in the literature. 



3. Simultaneous confidence bands 

We build here approximate simultaneous confidence bands for /i at the level 1—76 (0, 1). First 
assume that either Theorem 2.1 or 2.2 applies, i.e. that n^/^{fi — fi) converges weakly in C([0,T]), 
where fi denotes the corresponding estimator (2) or (3). Assume also that the covariance function R is 
non-degenerate and let R{t, t) be any uniformly consistent estimator of R{t, t) with respect to i G [0, T]. 
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With Slutsky, one sees that n^^^Cfinp — fJ,)/R{t, i)^/^ converges to Z = Q{0, p) in C([0, T]), where p is the 
correlation function of X. It suffices then to apply a classical result of [10^ to get that 

lim A^^logp/ sup Z{t)>)X=-(2 sup p{t,t)\ = . (4) 
lte[o,T] J V te[o,T] / 2 

Finally, use @ along with the inequality PjsupjgjoT] 1^(^)1 > -^l !i 2P{sup(g[Q -^j Z{t) > A} to derive 
the following approximate simultaneous confidence bands for fi : 

Ai(i)±(-21og(7/2)i?(i,t)/n)'^' (0<t<r). (5) 



4. Discussion 

The asymptotic normality results presented in this paper provide a new tool for making simultaneous 
inference on a trend function p, in the context of functional data. They can plausibly be extended to the 
framework of multivariate and/or vector- valued random processes and to the inference of derivatives of 
p. To the best of our knowledge, the simultaneous confidence band procedure of Section 3 is the only 
one appearing in the literature for functional data. Its implementation only requires the estimation of 
the mean and of the variance of X (not the whole covariance R) and some simulations have indicated its 
good performances in terms of empirical coverage probability. It would benefit from additional features 
such as bias correction or data-driven bandwidth selection. The asymptotic normality results may also 
be applied to constructing tests for p. A linearity test based on these results is currently under study. 
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